Semi-automatic Ground Truth Generation for Chart Image Recognition

نویسندگان

  • Li Yang
  • Weihua Huang
  • Chew Lim Tan
چکیده

While research on scientific chart recognition is being carried out, there is no suitable standard that can be used to evaluate the overall performance of the chart recognition results. In this paper, a system for semi-automatic chart ground truth generation is introduced. Using the system, the user is able to extract multiple levels of ground truth data. The role of the user is to perform verification and correction and to input values where necessary. The system carries out automatic tasks such as text blocks detection and line detection etc. It can effectively reduce the time to generate ground truth data, comparing to full manual processing. We experimented the system using 115 images. The images and ground truth data generated are available to the public.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generating Ground Truthed Dataset: Automatic or Semi-automatic?

Ground truthing tools mainly fall into two categories: automatic and semi-automatic. In this paper, we first discuss the pros and cons of the two approaches. We then report our own work on designing and implementing systems for generating chart image dataset and multilevel ground truth data. Both semi-automatic and automatic approaches were adopted, resulting in two independent systems. The dat...

متن کامل

Efficient Generation of Large Amounts of Training Data for Sign Language Recognition: A Semi-automatic Tool

We have developed a video hand segmentation tool which can help with generating hands ground truth from sign language image sequences. This tool may greatly facilitate research in the area of sign language recognition. In this tool, we offer a semi automatic scheme to assist with the localization of hand pixels, which is important for the purpose of recognition. A candidate hand generator is ap...

متن کامل

StrokeBank: Automating Personalized Chinese Handwriting Generation

Machine learning techniques have been successfully applied to Chinese character recognition; nonetheless, automatic generation of stylized Chinese handwriting remains a challenge. In this paper, we propose StrokeBank, a novel approach to automating personalized Chinese handwriting generation. We use a semi-supervised algorithm to construct a dictionary of component mappings from a small seeding...

متن کامل

Segmentation semi-automatique en plans pour la génération de cartes denses de disparités Semi-automatic Planar Segmentation Applied to the Generation of Dense Disparity Maps

This work falls under computer vision framework and more precisely planar segmentation applied to the generation of dense disparity maps. The goal is to produce new stereoscopic images with ground truth in order to evaluate and to compare precisely stereovision algorithms. We consider piecewise planar scenes and we propose a semi-automatic segmentation method based on the active contour models ...

متن کامل

A Ground Truth Tool for Synthetic Aperture Radar (SAR) Imagery

The performance of Computer Vision algorithms has made great strides and it is good enough to be useful in a number of civilian and military applications. Algorithm advancement in Automatic Target Recognition (ATR) in particular, has reached a critical point. State-of-the-art ATRs are capable of delivering robust performance for certain operational scenarios. As Computer Vision technology matur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006